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METHOD FOR PREPARING POLYFUNCTIONALIZED PEPTIDES AND/OR 
PROTEINS VIA NATIVE CHEMICAL LIGATION 


Related Applications 
[0001] This application is related to provisional application No.: 60/500,708, filed 

September 5, 2003;. which is hereby incorporated by reference in its entirety. 

Government Support 
[0002] The invention was supported in part by Grant Nos.: CAl 03823 (formerly 

AI16943), CA62948 and AI051883 from the National Institutes of Health; and PC020147 from 
the U.S. Army Prostate Cancer Research Program. The U.S. government may have certain rights 
in this invention. 

Background of the Invention 
[0003] Glycoproteins arc important biomacromolecules that are biosynthesized through 

posttranslational glycosylation of newly fashioned proteins emerging from the ribosome. 
Interest in glycoproteins arises at many levels. A long-term goal of the growing field of 
chemistry-based glycobiology is the delineation of the consequences of glycosylation on critical 
properties such as protein folding, proteolytic stability, and cell adhesion.' Such insights could 
explain why nature bothers to glycosylate otherwise fimctional proteins. Moreover, 
glycoproteins have potentially important clinical roles in the contexts of vaccines, diagnostics, 
and therapeutics. Indeed, erythropoietin (1), albeit a heterogeneous glycoprotein,' is clinically 
valuable as a treatment for anemia, among other indications.** 

[0004] Many naturally occurring, medicinally important glycoproteins (c/, for example, 

erythropoietin*" and gpl20'') display multiple glycosylation sites containing large 
oUgosacchaiide domains. However, given the complexity and variability of biological 
glycosylation pathways,*" the isolation of homogeneous glycoproteins from natural sources in 
significant quantity is extremely difficult 

[0005] Numerous methods exist for the production of glycopeptides by chemical 

synthesis. For example, glycans have been introduced into peptides via amino acid "cassettes" 
with pendant protected saccharides,""^* through enzymatic manipulations of glycopeptides,'"* 
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or by conjugation of fully elaborated, complex saccharides to short synthetic peptides.^*^' 
Larger 0-linked glycopeptides have been synthesized using ligation techniques*^^^ such as 
expressed protein ligation.^^ Bertozzi and coworkers extended the scope of the "cassette" 
approach by applying native chemical ligation to the synthesis of a biologically active 
glycoprotein with two single-residue O-linked glycans*^ Tolbert and Wong described the 
ligation of a 392-residue intein-generated peptide thioester and a dipeptide fimctionalized with a 
single iV-acetylglucosamine residue. However, none of these approaches has allowed the 
assembly of complex glypeptides or glycoproteins multiply fimctionalized (e.g.. multiply 
glycosylated) at designated sites. 

[0006] Accordmgly, there remains a need for novel synthetic methods leading to the 

preparation of homogeneous polyfimctionalized peptides and/or proteins. Specifically, 
convergent, stereoselective, versatile methods for preparing such glycopeptides and/or 
glycoproteins are needed. 

Summary of the Invention 
[00071 In recognition of the need to provide access to polyfimctionalized peptides and 

proteins, the present invention, in one aspect, provides a system for preparing a 
polyfimctionalized peptide comprising a peptidic backbone made up of four or more amino 
acids, wherein two or more non-adjacent amino acids are mdependently substituted with a 
moiety having the structure: 

wherein each occurrence of is independently a substituted or unsubstituted, linear or 
branched, cyclic or acyclic, saturated or unsaturated aliphatic or heteroaliphatic moiety; 

each occurrence of A is independently an aliphatic, heteroaliphatic, aryl, heteroaryl or a 
pharmaceutically usefiil group or entity; 

with the proviso that the peptide sequence between any two consecutive, non-adjacent, 
amino acids bearing a A-L'- moiety comprises at least one cysteine residue. 
[0008] In certain embodiments, each occurrence of A is independently a 

pharmaceutically usefiil group or entity. In certain embodiments, each occurrence of A is 
independently a biomolecule, a small molecule, a macromoleculc or a diagnostic label. 
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[0009] In certain exemplary embodiments, each occurrence of A is independently a 

carbohydrate determinant having the structure: 



wherein a, b, c, d, e, f,,g, h, i, x, y and z are independently 0, 1, 2 or 3, with the proviso 
that the x, y and z bracketed structures represent furanose or pyranose moieties and the sum of b 
andcis 1 or 2, the sum ofd and fisl or 2, and the sum of g and i is 1 or 2, and with the proviso 
that X, y and z are not simultaneously 0; wherein Ro is hydrogen, a linear or branched chain 
alkyl, acyl, arylalkyl or aryl group; wherein each occurrence of Ri, Rj, R3. R4, Rs. R« , R7, Rs and 
R9 is independently hydrogen, OH, OR', NHR', NHCOR', F. CH2OH, CH2OR', a substituted or 
unsubstituted linear or branched chain alkyl, (mono-, di- or tri)hydroxyalkyl, (mono-, di- or 
tri)acyloxyalkyl, arylalkyl or aryl group; wherein each occurrence of R' is independently 
hydrogen, CHO, COOR", or a substituted or unsubstituted Unear or branched chain alkyl, acyl, 
arylalkyl or aryl group or a saccharide moiety having the structure: 


R12 



wherein Y and Z are mdependently NH or O; wherein k, 1, r, s, t, u, v and w are each 
independently 0, 1 or 2; with the proviso that the v and w bracketed structures represent furanose 
or pyranose moieties and the sum of 1 and k is 1 or 2, and the sum of s and u is 1 or 2, and with 
the proviso that v and w are not simultaneously 0; wherein R'o is hydrogen, a Imear or branched 
chain alkyl, acyl, arylalkyl or aryl group; wherein each occurrence of Rio, Rii, Ri2, R13. Ri4 and 
Ri5 is independently hydrogen, OH, OR"', NHR»'. NHCOR"', F, CH2OH, CHjOR"', or a 
substituted or unsubstituted linear or branched chain alkyl, (mono-, di- or tri)hydroxyalkyl, 
(mono-, di- or tri)acyloxyalkyl, arylalkyl or aryl group; wherein each occurrence of R16 is 
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hydrogen, COOH, COOR", CONHR", a substituted or unsubstituted linear or branched chain 
alkyl or aryl group; wherein each occurrence of R'" is hydrogen, CHO, C00R'\ or a substituted 
or unsubstituted linear or branched chain alkyl, acyl, arylalkyl or aryl group; and wherein each 
occurrence of R'' and R** ai« each independently H, or a substituted or unsubstituted linear or 
branched chain alkyl, arylalkyl or aryl group. 

[0010] In certain embodiments, the present invention provides a method for preparing a 

polyfunctionalized peptide having the structure: 



si and s2 are indq)endently an integer from 1 to about 20; 
tl, t2 and t3 are each independently an integer; 

R''* is hydrogen, alkyl, acyl, aryl, heteroaryl, -alkyl(aryl), -alkyl(heteroaryl), a nitrogen 
protecting group, an amino acid or a proctected amino acid; 

r'" is -or'"" or -NR'"'*R'°% wherein R'^ is hydrogen, alkyl, aryl, heteroaryl, - 
alkyl(aryl), -alkyl(heteroaryl), a carboxylic acid protecting group, an amino acid or a proctected 
amino acid; and R'"* and R'"" are independently hydrogen, alkyl, aryl, heteroaryl, -alkyl(aryl), - 
alkyl(heteroaryl), a nitrogen protecting group, an amino acid or a proctected amino acid; 

R***, R*^ and R" are independently H, alkyl. heteroalkyl, aryl, heteroaryl, -alkyl(aryl), - 
alkyl(heteroaryl), or a natural or non-natural amino acid side chain; 

each occurrence of is independently a substituted or imsubstituted aliphatic or 
heteroaliphatic moiety; 

Ai and A2 are each independently an aliphatic, heteroaliphatic, aryl, heteroaryl or a 
pharmaceutically useful group or entity; and 

at least one occurrence of the bracketed structure t2 is a cysteine residue or protected 
cysteine residue; 

wherein the method comprise a step of: 


Page 4 of 56 

Express MaU No.: EV 170393236 US " 
3665577v2 


Attorney Docket No.: 2003080-0149 
Client Reference No.: SK-1 156-PRO 


reacting a peptide acyl donor having the structure: 



ti si t 


vdth a peptide amine acceptor having the structure: 



under suitable conditions to effect disulfide exchange ligation; 

wherein the sum t+t' equals (t2)+l; R^' is a sulfide protecting group; and R'^" is a group 
such that the moiety -C(=0)SR^*' can be made to undergo disulfide exchange with the 
glycopeptide amine acceptor. 

[00111 In certain exemplary embodiments, Ai and A2 are each independentiy a 

carbohydrate domain as defined for A above. 

[0012] In certain embodiments, the step may be carried out once, or repeated a desired 

number of times, to prepare the polyfimctionalized peptide having the structure: 



to s t_ 


wherein R'^' and R^ are as defined above; 
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each occurrence of A may be the same or different and may be as defined for Ai and A2 

above; 

each occurrence of R^' may be the same or dififerent and may be as defined for R^' and 
r"" above; 

q is an integer greater than or equal to 2; 

each occurrence of s is independently an integer firom 1 to about 20; 
each occurrence of t is independently an integer; 
to is an integer; and 

each occurrence of R™ is independently H, alkyl, heteroalkyl, aryl, heteroaryl, - 
alkyl(aryl), -alkyl(heteroaryl), or a natural or non-natural amino acid side chain. 

Definitions 

[0013] Certain specific fimctional groups defined in the inventive method are described 

in more detail below. For purposes of this invention, the chemical elements are identified in 
accordance with the Periodic Table of the Elements, CAS version. Handbook of Chemistry and 
Physics, 75* Ed., inside cover, and specific functional groups are defined as described therein. 
Additionally, general principles of organic chemistry, as well as specific functional moieties and 
reactivity, are described in "Organic Chemistry", Thomas Sorrell, University Science Books, 
Sausalito: 1999, the entire contents of which are incorporated herem by reference. 
10014) It will be appreciated that the compounds, as described herein, may be substituted 

with any number of substituents or functional moieties. In general, the term "substituted" 
whether preceded by the term "optionally" or not, and substituents contained in formulas of this 
invention, refer to the replacement of hydrogen radicals in a given structure with the radical of a 
specified substituent. When more than one position in any given structure may be substituted 
with more than one substituent selected fi»m a specified group, the substituent may be either the 
same or different at every position unless otherwise indicated. As used herein, the term 
"substituted" is contemplated to include all permissible substituents of organic compounds. In a 
broad aspect, the permissible substituents include acyclic and cyclic, branched and unbranched, 
carbocyclic and heterocyclic, aromatic and nonaromatic substituents of organic compounds. For 
purposes of this invention, heteroatoms such as nitrogen may have hydrogen substituents and/or 
any permissible substituents of organic compounds described herein which satisfy the valencies 
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of the heteroatoms. Furthermore, this invention is not intended to be limited in any manner by 
the permissible substituents of organic compounds. Combinations of substituents and variables 
envisioned by this invention are preferably those that result in the formation of stable compounds 
useful in the treatment and/or prevention of HIV, or in the inducement of antibodies, as 
described herein. The term "stable", as used herein, preferably refers to compounds which 
possess stability sufficient to allow manufacture and which maintain the integrity of the 
compound for a sufficient period of time to be useful for the purposes detailed herein. 
[0015] The term "aliphatic", as used herein, includes both saturated and unsaturated, 

straight chain (/.e., unbranched) or branched aliphatic hydrocarbons, which are optionally 
substituted with one or more fimctional groups. As will be appreciated by one of ordinary skiU 
in the art, "aliphatic" is intended herein to include, but is not limited to, alkyl, alkenyl, alkynyl 
moieties. Thus, as used herein, the term "alkyl" includes straight and branched alkyl groups. An 
analogous convention applies to other generic terms such as "alkenyl", "alkynyl" and the like. 
Furdiermore, as used herein, the terms "alkyl", "alkenyl", "alkynyl" and the like encompass both 
substituted and unsubstituted groups. In certain embodiments, as used herein, "lower alkyl" is 
used to indicate those alkyl groups (cyclic, acyclic, substituted, unsubstituted, branched or 
unbranched) having 1-6 carbon atoms. 

[00161 In certain embodiments, the alkyl, alkenyl and alkynyl groups employed in the 

invention contain 1-20 aliphatic carbon atoms. In certain other embodiments, the alkyl, alkenyl, 
and alkynyl groups employed in the invention contain 1-10 aliphatic carbon atoms. In yet other 
embodiments, the alkyl, alkenyl, and alkynyl groups employed in tiie invention contain 1-8 
aliphatic carbon atoms. In still other embodiments, the alkyl, alkenyl, and alkynyl groups 
employed in the invention contain 1-6 aliphatic carbon atoms. In yet other embodiments, the 
alkyl, alkenyl, and alkynyl groups employed in the invention contain 1-4 carbon atoms. 
Illustrative aliphatic groups tiius include, but are not limited to, for example, methyl, ethyl, n- 
propyl, isopropyl, allyl, n-butyl, sec-butyl, isobutyl. tert-butyl, n-pentyl, sec-pentyl, isopentyl, 
tert-pentyl, n-hexyl, sec-hexyl, moieties and the like, which again, may bear one or more 
substitiients. Alkenyl groups include, but are not limited to, for example, ethenyl, propenyl, 
butenyl, l-metiiyl-2-buten-l-yl, and the like. Representative alkynyl groups include, but are not 
limited to, ethynyl, 2-propynyl (propargyl), 1-propynyl and the like. 
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[0017] The term "alicyclic", as used herein, refers to compounds which combine the 

properties of aliphatic and cyclic compounds and include but are not limited to cyclic, or 
polycyclic aliphatic hydrocarbons and bridged cycloalkyl compounds, which are optionally 
substituted with one or more functional groups. As will be appreciated by one of ordinary skill 
in the art. "alicyclic'* is intended herein to include, but is not limited to, cycloalkyl, cycloalkenyl, 
and cycloalkynyl moieties, which are optionally substituted with one or more functional groups. 
Illustrative alicyclic groups thus include, but are not limited to, for example, cyclopropyl, -CH2- 
cyclopropyl, cyclobutyl, -CH2-cyclobutyl, cyclopentyl, -CHi-cyclopentyl-n, cyclohexyl, -CH2- 
cyclohexyl, cyclohexenylethyl, cyclohexanylethyl, norborbyl moieties and the like, which again, 
may bear one or more substituents. 

[0018] The term "alkoxy" (or "alkyloxy"), or "thioalkyl*' as used herein refers to an alkyl 

group, as previously defined, attached to the parent molecular moiety through an oxygen atom or 
through a sulfur atom. In certain embodiments, the alkyl group contains 1-20 aliphatic carbon 
atoms. In certain other embodiments, the alkyl group contains 1-10 aliphatic carbon atoms. In 
yet other embodiments, the alkyl, alkenyl, and alkynyl groups employed in the invention contain 
1-8 aliphatic carbon atoms. In stiU other embodiments, the alkyl group contains 1-6 aliphatic 
carbon atoms. In yet other embodiments, the alkyl group contains 1-4 aliphatic carbon atoms. 
Examples of alkoxy, include but are not limited to, methoxy, ethoxy, propoxy, isopropoxy, n- 
butoxy, tert-butoxy, neopentoxy and n-hexoxy. Examples of thioalkyl include, but are not 
limited to, methylthio, ethylthio, propylthio, isopropylthio, n-butylthio, and the like. 
[0019] The term "alkylamino" refers to a group having the structure -NHR'wherein R' is 

alkyl, as defined herein. The term "aminoalkyl" refers to a group having the structure. NH2R'-, 
wherein R' is alkyl, as defined herein. In certain embodiments, the alkyl group contains 1-20 
aliphatic carbon atoms. In certain other embodiments, the alkyl group contains 1-10 aliphatic 
carbon atoms. In yet other embodiments, the alkyl, alkenyl, and alkynyl groups employed in the 
invention contain 1-8 aUphatic carbon atoms. In still other embodiments, the alkyl group 
contains 1-6 aliphatic carbon atoms. In yet other embodiments, the alkyl group contains 1-4 
aliphatic carbon atoms. Examples of alkylamino include, but are not limited to, methylamino, 
ethylamino, iso-propylamino and the like. 

[0020] Some examples of substituents of the above-described aliphatic (and other) 

moieties of compounds of the invention include, but are not limited to aliphatic; heteroaliphatic; 
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aryl; heteroaryl; alkylaryl; alkylheteroaryl; alkoxy; aryloxy; heteroalkoxy; heteroaryloxy; 
alkylthio; arylthio; heteroalkylthio; heteroarylthio; F; CI; Br; I; -OH; -NO2; -CN; -CF3; - 
CH2CF3; -CHCI2; -CH2OH; -CH2CH2OH; -CH2NH2; -CH2SO2CH3; -C(0)Rx; -C02(Rx); - 
C0N(Rx)2; -OC(0)Rx; -OCCbRx; -OCON(Rx)2; -N(Rx)2; -S(0)2Rx; -NRx(CO)Rx wherein each 
occurrence of Rx independently includes, but is not limited to, aliphatic, heteroaliphatic, aryl, 
heteroaryl, alkylaryl, or alkylheteroaryl, wherein any of the aliphatic, heteroaliphatic, alkylaryl, 
or alkylheteroaryl substituents described, above and herein may be substituted or unsubstituted, 
branched or unbranched, cyclic or acyclic, and wherein any of the aryl or heteroaryl substituents 
described above and herein may be substituted or unsubstituted. Additional examples of 
generally applicable substituents are illustrated by the specific embodiments shown in the 
Examples that are described herein. 

[0021] In general, the terms "aryl" and "heteroaryl", as used herein, refer to stable mono- 

or polycyclic, heterocyclic, polycyclic, and polyheterocyclic unsaturated moieties having 
preferably 3-14 carbon atoms, each of which may be substiUited or unsubstituted. It will also be 
appreciated that aryl and heteroaryl moieties, as defined herein may be attached via an aliphatic, 
alicycUc, heteroaliphatic, heteroaHcyclic, alkyl or heteroalkyl moiety and thus also include - 
(aliphatic)aryl, -(heteroaIiphatic)aryl, -(aliphatic)heteroaryl, -(heteroaliphatic)heteroaryl, - 
(alkyl)aryl, -(heteroalkyl)aryl, -(heteroalkyl)aryl, and -(heteroalkyl)heteroaryl moieties. Thus, as 
used herein, the phrases "aryl or heteroaryl" and "aryl, heteroaryl, -(aliphatic)aryl, - 
(heteroaliphatic)aryl, -(aliphatic)heteroaryl, -(heteroaliphatic)heteroaryl, -(alkyl)aryl, - 
(heteroalkyl)aryl, -(heteroalkyl)aryl, and -(heteroalkyl)heteroaryl" are interchangeable. 
Substituents include, but are not limited to, any of the previously mentioned substitutents, i.e., 
the substituents recited for aliphatic moieties, or for other moieties as disclosed herein, resulting 
in the formation of a stable compound. In certain embodiments of the present invention, "aryl" 
refers to a mono- or bicyclic carbocyclic ring system having one or two aromatic rings including, 
but not limited to, phenyl, naphthyl, tetrahydronaphthyl, indanyl, indenyl and the like, hi certain 
embodiments of the present invention, the term "heteroaryl", as used herein, refers to a cyclic 
aromatic radical having from five to ten ring atoms of which one ring atom is selected from S, O 
and N; zero, one or two ring atoms are additional heteroatoms independently selected from S, O 
and N; and the remaining ring atoms are carbon, the radical being joined to the rest of the 
molecule via any of the ring atoms, such as, for example, pyridyl, pyrazinyl, pyrimidinyl, 
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pyrrolyl, pyrazolyl, imidazolyl, thiazolyl, oxazolyl, isooxazolyl, thiadiazolyl, oxadiazolyl, 
thiophenyl, furany 1 , quinolinyl, isoquinolinyl, and the like. 

[0022] It will be ^predated that aryl and heteroaryl groups (including bicyclic aryl 

groups) can be unsubstituted or substituted, wherein substitution includes replacement of one, 
two or three of the hydrogen atoms thereon independently with any one or more of the followmg 
moieties including, but not limited to: aliphatic; heteroaliphatic; aryl; heteroaryl; alkylaryl; 
alkylheteroaryl; alkoxy; aryloxy; heteroalkoxy; heteroaryloxy; alkylthio; arylthio; 
heteroalkylthio; heteroarylthio; F; CI; Br; I; -OH; -NO2; -CN; -CF3; -CH2CF3; -CHCh; - 
CH2OH; -CH2CH2OH; -CH2NH2; -CH2SO2CH3; -C(0)Rx; -C02(Rx); -CON(Rx)2; -OC(0)Rx; - 
OCO2RX; -0C0N(Rx)2; -N(Rx)2; -S(Q)2Rx; -NRx(CO)Rx wherein each occurrence of Rx 
independently includes, but is not limited to, aliphatic, heteroaliphatic, aryl, heteroaryl, alkylaryl, 
or alkylheteroaryl, wherein any of the aliphatic, heteroaliphatic, alkylaryl, or alkylheteroaryl 
substituents described above and herein may be substituted or unsubstituted, branched or 
unbranched, cyclic or acyclic, and wherein any of the aryl or heteroaryl substituents described 
above and herein may be substituted or unsubstituted. Additional examples of generally 
applicable substituents are illustrated by the specific embodiments shown in the Examples that 
are described herein. 

[0023] The term "cycloalkyl", as used herein, refers specifically to groups having three to 

seven, preferably three to ten carbon atoms. Suitable cycloalkyls include, but are not limited to 
cyclopropyl, cyclobutyl, cyclopentyl, cyclohexyl, cycloheptyl and the like, which, as m the case 
of aliphatic, heteroaliphatic or heterocyclic moieties, may optionally be substituted with 
substituents including, but not limited to aliphatic; heteroaliphatic; aryl; heteroaryl; alkylaryl; 
alkylheteroaryl; alkoxy; aryloxy; heteroalkoxy; heteroaryloxy; alkylthio; arylthio; 
heteroalkylthio; heteroarylthio; F; CI; Br; I; -OH; -NO2; -CN; -CF3; -CH2CF3; -CHCI2; - 
CH2OH; .CH2CH2OH; -CH2NH2; -CH2SO2CH3; -C(0)Rx; -C02(Rx); -CON(R.)2; -OC(0)Rx; - 
OCO2RX; -OCON(Rx)2; -N(Rx)2; -S(0)2Rx; -NRx(CO)Rx wherein each occurrence of Rx 
independently includes, but is not lunited to, aliphatic, heteroaliphatic, aryl, heteroaryl, alkylaryl, 
or alkylheteroaryl, wherein any of the aliphatic, heteroaliphatic, alkylaryl, or alkylheteroaryl 
substituents described above and herem may be substituted or unsubstituted, branched or 
unbranched, cyclic or acyclic, and wherein any of the aryl or heteroaryl substituents described 
above and herein may be substituted or unsubstituted. Additional examples of generally 
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applicable substituents are illustrated by the specific embodiments shown in the Examples that 
are described herein. 

[0024] The term "heteroaliphatic", as used herein, refers to aliphatic moieties in which 

one or more carbon atoms in the main chain have been substituted with a heteroatom. Thus, a 
heteroaliphatic group refers to an aliphatic cham which contains one or more oxygen, sulfur, 
nitrogen, phosphorus or silicon atoms, e.g., in place of carbon atoms. Heteroaliphatic moieties 
may be branched or linear unbranched. In certain embodiments, heteroaliphatic moieties are 
substituted by independent replacement of one or more of the hydrogen atoms thereon with one 
or more moieties includmg, but not limited to aliphatic; alicyclic; heteroaliphatic; heteroalicyclic; 
aryl; heteroaryl; alkylaryl; alkylheteroaryl; alkoxy; aryloxy; heteroalkoxy; heteroaryloxy; 
alkylthio; arylthio; heteroalkylthio; heteroarylthio; F;. CI; Br; I; -OH; -NO2; -CN; -CF3; - 
CH2CF3; -CHCI2; -CH2OH; -CH2CH2OH; -CH2NH2; -CH2SO2CH3; -C(0)Rx; -C02(Rx); - 
CON(Rx)2; -OC(0)Rx; -OCOzRx; -OCON(Rx)2; -N(Rx)2; -S(0)2Rx; -NRx(CO)Rx wherein each 
occuirence of Rx independently includes, but is not limited to, aliphatic, alicyclic, heteroaliphatic, 
heteroalicyclic, aryl, heteroaryl, alkylaryl, or alkylheteroaryl, wherein any of the aliphatic, 
alicyclic, heteroaliphatic, heteroalicyclic, alkylaryl, or alkylheteroaryl substituents described 
above and herein may be substituted or unsubstituted, branched or unbranched, cyclic or acyclic, 
and wherein any of the aryl or heteroaryl substituents described above and herein may be 
substituted or unsubstituted. Additional examples of generally applicable substituents are 
illustrated by the specific embodiments shown in the Examples that are described herein. 
[0025] The term "heteroalicyclic", as used herein, refers to compounds which combine 

the properties of heteroaliphatic and cyclic compounds and include but are not limited to 
saturated and unsaturated mono- or polycyclic heterocycles such as morpholino, pyrrolidinyl, 
fiiranyl, thiofiiranyl, pyrrolyl etc., which are optionally substituted with one or more fimctional 
groups, as defined herein. 

[0026] Additionally, it will be appreciated that any of the alicyclic or heteroalicyclic 

moieties described above and herein may comprise an aryl or heteroaryl moiety fiised thereto. 
Additional examples of generally applicable substituents are Ulustrated by the specific 
embodiments shown in the Examples that are described herein. 

[0027] The terms "halo" and "halogen" as used herein refer to an atom selected &om 

fluorine, chlorine, bromine and iodine. 

Page 1 1 of 56 

Express Mail No.: EV 170393236 US Attorney Docket No.: 200308a«149 

^ Client Reference No.: SK-1 1 56-PRO 


366SS77v2 


{00281 The terni "haloalkyl" denotes an alkyl group, as defined above, having one, two, 

or three halogen atoms attached thereto and is exemplified by such groups as chloromethyl, 
bromoethyl, trifluoromethyl, and the like. 

[00291 The term "heterocycloalkyl" or "heterocycle", as used herein, refers to a non- 

aromatic 5-, 6- or 7- membered ring or a polycyclic group, including, but not limited to a bi- or 
tri-cyclic group comprising fiised six-membered rings having between one and three heteroatoms 
independently selected from oxygen, sulfiir and nitrogen, wherein (i) each 5-membered ring has 
0 to 1 double bonds and each 6-membered ring has 0 to 2 double bonds, (ii) the nitrogen and 
sulfur heteroatoms may be optionaUy be oxidized, (iii) the nitrogen heteroatom may optionally 
be quatemized, and (iv) any of the above heterocyclic rings may be fused to an aryl or heteroaryl 
ring. Representative heterocycles include, but are not linuted to, pyrrolidinyl, pyrazolinyl, 
pyrazolidinyl, imidazolinyl, imidazolidinyi, piperidinyl, piperazmyl, oxazolidinyl, 
isoxazolidinyl, morpholinyl, thiazolidinyl, isothiazolidinyl, and tetrahydrofuryl. In certain 
embodiments, a "substituted heterocycloalkyl or heterocycle" group is utilized and as used 
herein, refers to a heterocycloalkyl or heterocycle group, as defined above, substituted by the 
independent replacement of one, two or three of the hydrogen atoms thereon with but are not 
limited to aliphatic; heteroaliphatic; aryl; heteroaryl; alkylaryl; alkylheteroaryl; alkoxy; aryloxy; 
heteroalkoxy; heteroaryloxy; alkylthio; arylthio; heteroalkylthio; heteroarylthio; F; CI; Br; I; - 
OH; -NO2; -CN; -CF3; -CH2CF3; -CHCb; -CH2OH; -CH2CH2OH; -CH2NH2; -CH2SO2CH3; - 
C(0)Rx; -C02(R«); -CON(Rx)2; -OC(0)Rx; -OCO2RX; -OCON(Rx)2; -N(Rx)2; -S(0)2Rx; - 
NRx(CO)Rx wherein each occurrence of Rx independently includes, but is not limited to, 
aliphatic, heteroaliphatic, aryl, heteroaryl, alkylaryl, or alkylheteroaryl, wherein any of the 
aliphatic, heteroaliphatic, alkylaryl, or alkylheteroaryl substituents described above and herein 
may be substituted or unsubstituted, branched or unbranched, cycUc or acyclic, and wherein any 
of the aryl or heteroaryl substitutents described above and herein may be substiUited or 
unsubstituted. Additional examples or generally applicable substituents are illustrated by the 
specific embodiments shown in the Examples, which are described herein. 
[0030] As used herein, the terms "aliphatic", "heteroaliphatic", "alkyl", "alkenyl", 

"alkynyl", "heteroalkyl", "heteroalkenyl", "heteroalkynyl", and the like encompass substituted 
and unsubstituted, saturated and unsaturated, and linear and branched groups. Similarly, the 
terms "alicyclic", "heteroalicyclic", "heterocycloalkyl", "heterocycle" and the like encompass 
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substituted and unsubstituted, and saturated and unsaturated groups. Additionally, the terms 
"cycloalkyl", "cycloalkenyl", "cycloalkynyr, "heterocycloalkyl", "heterocycloalkenyl", 
"heterocycloalkynyl", "aryl", "heteroaryr and the like encompass both substituted and 
unsubstituted groups. 

[0031] It will be appreciated that additional examples of generally applicable 

substitutents are illustrated by the specific embodiments shown in the Examples which are 
described herein, but are not limited to these Examples. 

[0032] By the term **protectmg group'*, has used herein, it is meant that a particular 

functional moiety, e.g., O, S. or N, is temporarily blocked so that a reaction can be carried out 
selectively at another reactive site in a multifunctional compound. In preferred embodunents, a 
protecting group reacts selectively in good yield to give a protected substrate that is stable to the 
projected reactions; tiie protecting group must be selectively removed m good yield by readily 
available, preferably nontoxic reagents that do not attack tiie otiier functional groups; the 
protecting group forms an easily separable derivative (more preferably witiiout tiie generation of 
new steteogenic centers); and tiie protecting group has a minimum of additional functionality to 
avoid fiirtiier sites of reaction. As detailed herem, oxygen, sulfur, nitrogen and carbon protecting 
groups may be utilized. For example, in certain embodiments, as detailed herein, certain 
exemplary oxygen protecting groups are utilized. These oxygen protecting groups include, but 
are not limited to metiiyl etiiers, substituted metiiyl etiiers (e.g., MOM (metiioxymetiiyl etiier), 
MTM (metiiyltiiiometiiyl etiier), BOM (benzyloxymetiiyl etiier), PMBM or MPM (p- 
metiioxybenzyloxymetiiyl etiier), to name a few), substitirted etiiyl etiiers, substitiited benzyl 
etiiers, silyl etiiers TMS (trimetiiylsilyl etiier), TES (trietiiylsilyletiier), TIPS 

(triisopropylsilyl etiier), TBDMS (t-butyldimetiiylsilyl etiier), tiibenzyl silyl etiier, TBDPS (t- 
butyldiphenyl silyl etiier), to name a few), esters (e.g., formate, acetate, benzoate (Bz), 
trifluoroacetate, dichloroacetate, to name a few), carbonates, cyclic acetals and ketals. In certain 
otiier exemplary embodiments, nitrogen protecting groups are utilized. These nitrogen 
protecting groups include, but are not limited to, carbamates (including metiiyl, etiiyl and 
substituted etiiyl caifoamates (e.g., Troc), to name a few) amides, cyclic imide derivatives, N- 
Alkyl and N-Aryl amines, imine derivatives, and enamine derivatives, to name a few. Certain 
otiier exemplary protecting groups are detailed herein, however, it will be appreciated tiiat tiie 
present invention is not intended to be limited to tiiese protecting groups; ratiier, a variety of 
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additional equivalent protecting groups can be readily identified using the above criteria and 
utilized in the present invention. Additionally, a variety of protecting groups are described in 
"Protective Groups in Organic Synthesis" Third Ed. Greene, T.W. and Wuts, P.G., Eds.. John 
Wiley & Sons, New York: 1999, the entire contents of which are hereby incorporated by 
reference. 

[0033] The term "natural amino acid side cham" as used herein refers to the side chain of 

any one of the common, naturally occurring L-amino acids found in naturally occurring proteins: 
glycine (Gly), alanine (Ala), valine (Val), leucine (Leu), isoleucine (He), lysine (Lys). arginine 
(Arg). histidine (His), proUne (Pro), serine (Ser), threonine (Thr). phenylalanine (Phe), tyrosine 
(Tyr), tryptophan (Trp), aspartic acid (Asp), glutamic acid (Glu), asparagme (Asn), glutamine 
(Gin), cysteine (Cys) and methionine (Met). 

[00341 The term "unnatural ammo acid side chain" as used herein refers to the side chain 

of all amino acids which are not natural amino acids. This includes, for example, a-, P-, D-, L- 


amino acid residues, and compounds of the general formula o wherein the side 

chain R is other than the amino acid side chains occurring in nature. 

[0035] More generally, the term "amino acid side chain", as used herein, encompasses 

natural amino acid and unnatural amino acid side chains. 

[0036] As used herein, the term "pharmaceutically useful group or entity" refers to a 

compound or fragment thereof, or an organic moiety which, when covalently attached to a 
peptide or protein, can exert some biological or diagnostic function or activity when 
administered to a subject, or enhance the therapeutic, diagnostic or preventive properties of the 
parent peptide and/or protein in biomedical applications, or improve safety, alter biodegradation 
or excretion, or is detectable. Examples of suitable pharmaceutically useful groups or entities 
include hydrophilicity/hydrophobicity modifiers, phannacokmetic modifiers, biologically active 
modifiers, detectable modifiers. A modifier can have one or more pharmaceutical fimctions, e.g., 
biological activity and/or pharmacokinetics modification. Pharmacokinetics modifiers can 
include, for example, antibodies, antigens, receptor ligands, hydrophiUc. hydrophobic or charged 
groups. Biologically active modifiers include, for example, therapeutic drugs and prodrugs, 
antigens, immunomodulators. Detectable modifiers include diagnostic labels, such as 
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radioactive, fluorescent, paramagnetic, superparamagnetic, ferromagnetic. X-ray modulating, X- 
ray-opaque, ultrosound-reflective, and other substances detectable by one of available clinical or 
laboratory methods, e.g., scintigraphy, NMR spectroscopy, MRI, X-ray tomography, 
sonotomography, photoimaging, radioimmunoassay. Modifiers can be small molecules or 
macromolecules, and can belong to any chemical or pharmaceutical class, e.g., nucleotides, 
chemotherapeutic agents, antibacterial agents, antiviral agents, immunomodulators, hormones or 
analogs thereof, enzymes, inhibitors, alkaloids and therapeutic radionuclides. Viral and non-viral 
gene vectors are considered to be a pharmaceutically useful entity or group. 
10037] The term "biomolecules", as used herein, refers to molecules {e.g., proteins, 

amino acids, peptides, polynucleotides, nucleotides, carbohydrates, sugars, Upids, 
nucleoproteins, glycoproteins, lipoproteins, steroids, etc.) which belong to classes of chemical 
compounds, whether naturally-occurring or artificially created {e.g., by synthetic or recombinant 
methods), that are commonly found m cells and tissues. Specific classes of biomolecules 
include, but are not limited to, enzymes, receptors, neurotransmitters, hormones, cytokines, cell 
response modifiers such as growth factors and chemotactic factors, antibodies, vaccines, haptens, 
toxins, interferons, ribozymes, anti-sense agents, plasmids, DNA, and RNA. 
[0038] As used herein, the term "small molecule" refers to molecules, whether naturally- 

occurring or artificially created {e.g., via chemical synthesis) that have a relatively low molecular 
weight. Preferred small molecules are biologically active in that they produce a local or systemic 
effect in animals, preferably mammals, more preferably humans. Typically, small molecules 
have a molecular weight of less than about 1500 g/mol. In certain preferred embodiments, the 
small molecule is a drug. Preferably, though not necessarily, the drug is one that has aheady 
been deemed safe and effective for use by the appropriate governmental agency or body. For- 
example, drugs for human use Usted by the FDA under 21 C.F.R. §§ 330.5, 331 through 361, and 
440 through 460; drugs for veterinary use listed by the FDA under 21 C.F.R. §§ 500 through 
589, incorporated herein by reference, are all considered suitable for use with the present 
hydrophilic polymers. 

[0039] Classes of small molecule drugs that can be used in the practice of the present 

invention include, but are not limited to, vitamins, anti-ATOS substances, anti-cancer substances, 
antibiotics, immunosuppressants, anti-viral substances, enzyme inhibitors, neurotoxins, opioids, 
hypnotics, anti-histamines, lubricants, tranquilizers, anti-convulsants, muscle relaxants and 
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anti-Parkinson substances, anti-spasmodics and muscle contractants including channel blockers, 
miotics and anti-cholinergics, anti-glaucoma compounds, anti-parasite and/or anti-protozoal 
compounds, modulators of cell-extracelliilar matrix interactions including cell growth inhibitors 
and anti-adhesion molecules, vasodilating agents, inhibitors of DNA, RNA or protein synthesis, 
anti-hypertensives, analgesics, anti-pyretics, steroidal and non-steroidal anti-inflammatory 
agents, anti-angiogenic factors, anti-secretory factors, anticoagulants and/or antithrombotic 
agents, local anesthetics, ophthahnics, prostaglandins, anti-depressants, anti-psychotic 
substances, anti-emetics, imaging agents . Many large molecules are also drugs. 
[0040] . A more complete, although not exhaustive, listing of classes and specific drugs 
suitable for practicing the present invention may be found in "Pharmaceutical Substances: 
Syntheses, Patents, Applications" by Axel Kleemann and Jurgen Engel, Thieme Medical 
Publishing, 1999 and the "Merck Index: An Encyclopedia of Chemicals, Drugs, and 
Biologicals", Edited by Susan Budavari et al, CRC Press, 1996, both of which are incorporated 
herein by reference. 

[0041] As used herein, the term "macromolecules" refers to molecules, whether 

naturally-occurring or artificially created (e:g., via chemical synthesis) that have a relatively high 
molecular weight, e.g., generally above 1500 g^mole. Preferred macromolecules are biologically 
active in that they exert a biological function in animals, preferably mammals, more preferably 
humans. Examples of macromolecules include proteins, enzymes, growth factors, cytokines, 
peptides, polypeptides, polylysine, proteins, lipids, polyelectrolytes, immimoglobulins, DNA, 
RNA, ribozymes, plasmids, and lectins. For the purpose of this invention, supramolecular 
constructs such as viruses and protein associates {e.g., dimers) are considered to be 
macromolecules. When covalently attached to a peptide or protein, a macromolecule may be 
chemically modified prior to being covalently attached to said peptide or protein. 
[0042] As used herein, the term "diagnostic label" refers to an atom, group of atoms, 

moiety or functional group, a nanocrystal, or other discrete element of a composition of matter, 
that can be detected in vivo or ex vivo using analytical methods known in the art. When 
covalently attached to a peptide or protein, such diagnostic labels permit the monitoring of the 
peptide or protein in vivo. On the other hand, constructs and compositions that include diagnostic 
labels can be used to monitor biological functions or structures. Examples of diagnostic labels 
include, without limitations, labels that can be used in medical diagnostic procedures, such as. 
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radiopharmaceutical or radioactive isotopes for gamma scintigraphy and Positron Emission 
Tomography (PET), contrast agent for Magnetic Resonance Imaging (MRI) (for example 
paramagnetic atoms and superparamagnetic nanocrystals), contrast agent for computed 
tomography, contrast agent for X-ray imaging method, agent for ultrasound diagnostic method, 
agent for neutron activation, and moiety which can reflect, scatter or affect X-rays, ultrasounds, 
radiowaves and microwaves, fluorophores in various optical procedures, etc. 

Brief Description of the Drawing 
[0043] Figure 1 depicts examples of bifunctional glycopeptides (i.e., diglycosylated 

peptides) containing O-linkages preparaed by the method of the invention. 
[0044] Figure 2 depicts anomeric p-glycosylamines (22 and 23) that differ at a single 

stereocenter (asterisks), and a bifunctional glycopeptide (24) containing one of these structurally 
complex glycaris. 

Detailed Descrifhon of certain preferred embodiments of the Invention 
[0045] In one aspect, the present invention provides novel methodologies for the 

synthesis of peptides and/or proteins fimctionalized at two or more non-adjacent amino acid sites 
{e.g., bearing a pharmaceutically useful group or entity covalently attached to at least two non- 
adjacent ammo acid residues of the polypeptide/protein chain). In certain embodiments, the 
invention provides a system for the synthesis of polypeptides and proteins bearing at least two 
glycans covalently attached to non-adjacent amino acid residues of the polypeptide/protein chain. 
In certain embodiments, in the context of synthetic studies developed for the total synthesis of 
glycosylated fragments of erythropoietin, generalized methodologies were developed for the 
improved synthesis of polyglycosylated peptides and proteins. This general synthetic method 
encompasses the realization that native chemical ligation (NCL) is a glycan-compatible process 
that may be used for assembling large multiply glycosylated polypeptides from glycopeptide 
building blocks. In yet another aspect, the present invention also provides the recognition that 
chemoselective reaction of a glycopeptide containing a C-tenninal cysteine with a glycopeptide 
thioester may be achieved to from the corresponding diglycosylated peptide adduct. 
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[0046] Specific examples, particularly with respect to the synthesis of bifimctional 

glycopeptides (/.e., diglycosylated peptides), are described in more detail below and in the 
Exemplification herein, along with certain general methodologies developed during the course of 
these syntheses. It will be appreciated by one of ordinary skill in the art that these examples are 
not intended to be limiting; rather all equivalents are intended to be incorporated into the scope 
of the present invention. In particular, the inventive method may be generally adapted to the 
preparation of polyfiinctionalized peptides and proteins. 
[0047] 1) Inventive method 

[0048] In one aspect of the present invention, there is provided a method for preparing a 

polyfimctionalized peptide comprising a peptidic backbone made up of four or more amino acids 
wherein two or more non-adjacent amino acids are independently substituted with a moiety 
having the structure: 

wherein each occurrence of l' is independently a substituted or unsubstituted, linear or 
branched, cyclic or acyclic, saturated or unsaturated aliphatic or heteroaliphatic moiety; 

each occurrence of A is independently an aliphatic, heteroaliphatic, aryl, heteroaryl or a 
pharmaceutically usefiil group or entity; 

with the proviso that the peptide sequence between any two consecutive, non-adjacent, 
amino acids bearing a A-L*- moiety comprises at least one cysteine residue. 
[0049] In certain embodiments, each occurrence of A is independently a 

pharmaceutically usefiil group or entity. In certain embodiments, each occurrence of A is 
independently a biomolecule, a small molecule, a macromolecule or a diagnostic label. 
[0050] In certain exemplary embodiments, each occurrence of A is independently a 

carbohydrate determinant having the structure: 

- R3 
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wherein a, b, c, d, e, f. g, h, i, x, y and z are independently 0, 1, 2 or 3, with the proviso 
that the x, y and z bracketed structures represent fiiranose or pyranose moieties and the sum of b 
and c is 1 or 2, the sum of d and f is 1 or 2, and the sum of g and i is 1 or 2, and with the proviso 
that X, y and z are not simultaneously 0; wherein Ro is hydrogen, a linear or branched chain 
alkyl, acyl, arylalkyl or aryl group; wherein each occurrence of Ri, R2, R3, R4, Rs, R« , R7, R« and 
R, is independently hydrogen, OH, OR', NHR', NHCOR*. F, CH2OH, CH2OR', a substituted or 
unsubstituted linear or branched chain alkyl, (mono-, di- or tri)hydroxyalkyl, (mono-, di- or 
tri)acyloxyalkyl, arylalkyl or aryl group; wherein each occurrence of R' is independently 
hydrogen, CHO, COOR", or a substituted or unsubstituted linear or branched chain alkyl, acyl, 
arylalkyl or aryl group or a saccharide moiety having the structure: 


R12 


o 


Rt3 


wherein Y and Z are independenUy NH or O; wherein k, 1, r, s, t, u, v and w are each 
independently 0, 1 or 2; with the proviso that the v and w bracketed structures represent fiiranose 
or pyranose moieties and the sum of 1 and k is 1 or 2, and the sum of s and u is 1 or 2, and with 
the proviso that v and w are not simultaneously 0; wherein R'o is hydrogen, a linear or branched 
chain alkyl, acyl, arylalkyl or aryl group; wherein each occurrence of Rio, Rii, R12, Ri3, Ri4 and 
R,5 is independently hydrogen, OH, OR'", NHR™. NHCOR'". F, CH2OH, CHjOR^ or a 
substituted or unsubstituted linear or branched chain alkyl, (mono-, di- or tri)hydroxyalkyl, 
(mono-, di- or tri)acyloxyalkyl, arylalkyl or aryl group; wherein each occurrence of R16 is 
hydrogen, COOH, COOR", CONHR", a substituted or unsubstituted linear or branched chain 
alkyl or aryl group; wherein each occurrence of R"' is hydrogen, CHO, C00R'\ or a substituted 
or unsubstituted linear or branched chain alkyl, acyl, arylalkyl or aryl group; and wherein each 
occurrence of R" and R*" are each independently H, or a substituted or unsubstituted linear or 
branched chain alkyl, arylalkyl or aryl group. 

[0051] In certain embodiments, each occurrence of L' is independently -O^CHzV. 

wherein n is 0-9, or a glycoside-containing moiety (e.g mono- or poly-saccharide). 
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10052] In certain embodiments, L' is -0-(CH2)„-CH2- and two or more non-adjacent 

amino acids is/are independently substituted with a moiety having the structure: 

wherein each occurrence of n is independently 0-8; and A is as defined above. In certain 
other embodiments, A is a carbohydrate moiety and each occurrence of the n-alkyl glycosidic 
moiety is independently either a- or p-linked to an amino acid residue of the backbone. It will 
be appreciated that polyfunctionalized peptides made accordmg to the method of the present 
invention are not limited to those where each occurrence of comprises n-alkyl where n is 
greater than or equal to 1; rather each occurrence of A can be independently linked via the 
traditional direct linkage (n=0). via n-alkyl (such as pentyl), via a mono-saccharide moiety or 
any combination thereof. In preferred embodiments, each occurrence of A is independently 
selected fiom the group consisting of Globo-H, fucosyl GMl, KH-1. glycophorin. STN. (2.3)ST, 
Le^, Le^ N3, Tn, 2,6-STn, Gb3 and TF. 

[00531 In certain embodiments, occurrences of A may be the same or different. 

[00541 In certain other embodiments, certain occurrences of A are clustered. For 

example, in certain embodiments, the polyfunctionalized peptide comprises at least two adjacent 
functionalized amino acids (/.e., clustered functionalized amino acids), separated from another 
functionalized site by a peptide amino acid sequence comprising at least one cysteine residue. 
[0055] In certain other embodiments, certain occurrences of A are clustered glycosides. 

For example, in certain embodhnents. the multiglycosylated peptide comprises at least two 
adjacent glycosylated amino acids (i.e., clustered glycosylated amino acids), separated from 
another glycosylated site by a peptide amino acid sequence comprising at least one cysteme 
residue. 

[0056] In certain embodiments, the present invention provides a method for preparing a 

polyfunctionalized peptide having the structure: 
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si and s2 are independently an integer from 1 to about 20; 
tl, t2 and t3 are each independently an integer; 

r'^* is hydrogen, alkyl, acyl, aryl, heteroaryl, -alkyl(aryl), -alkyl(heteroaryl), a nitrogen 
protecting group, an amino acid or a proctected amino acid; 

r'" is -OR'^ or -NR'"'*R'°', wherein R'"* is hydrogen, alkyl. aryl, heteroaryl, - 
alkyl(aryl), -alkyl(heteroaryl), a carboxylic acid protecting group, an amino acid or a proctected 
amino acid; and R'°'* and R'^ are independently hydrogen, alkyl, aryl, heteroaryl, -alkyl(aryl), - 
alkyl(heteroaryl), a nitrogen protecting group, an ammo acid or a proctected amino acid; 

r", R'^ and R" are independently H, alkyl, heteroalkyl, aryl, heteroaryl, -alkyl(aryl), - 
alkyl(heteroaryl), or a natural or non-natural amino acid side chain; 

each occurrence of is independently a substituted or unsubstituted aliphatic or 
heteroaliphatic moiety; 

Ai and A2 are each independently an aliphatic, heteroaliphatic, aryl, heteroaryl or a 
pharmaceutically usefiil group or entity; and 

at least one occurrence of the bracketed stracture t2 is a cysteine residue or protected 
cysteine residue; 

wherein the method comprises a step of: 
reacting a peptide acyl donor having the structure: 



tl si t 
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with a peptide amine acceptor having the structure: 



under suitable conditions to eflfect disulfide exchange ligation; 

wherein the sum t+t' equals (t2)+l; R^* is a sulfide protecting group; and K^^ is a group 
such that the moiety -C(=0)SR''° can be made to undergo disulfide exchange with the 
glycopeptide amine acceptor. 

In certain exemplary embodunents, A| and A2 are each independently a carbohydrate 
domain having the structure: 



wherein a, b, c, d, e, f, g, h. i, x, y and z are independently 0, 1. 2 or 3. with the proviso 
that the x, y and z bracketed structures represent fiiranose or pyranose moieties and the sum of b 
andcis 1 or 2, the sum of d and f is 1 or 2, and the sum of g and i is 1 or 2, and with the proviso 
that x, y and z are not simultaneously 0; wherein Ro is hydrogen, a linear or branched chain 
alkyl, acyl, arylalkyl or aryl group; wherein each occurrence of Ri, R2, R3, R4, Rs, Re , R?, R« and 
R9 is independently hydrogen. OH, OR\ NHR'. NHCOR', F, CH2OH, CH2OR', a substituted or 
unsubstituted linear or branched chain alkyl. (mono-, di- or tri)hydroxyalkyl, (mono-, di- or 
tri)acyloxyalkyl, arylalkyl or aryl group; wherein each occurrence of R' is independently 
hydrogen, CHO, COOR", or a substituted or unsubstituted linear or branched chain alkyl, acyl, 
arylalkyl or aryl group or a saccharide moiety having the structure: 
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Rl5 



Rl3 


Rl6 


[00571 wherein Y and Z are independently NH or O; wherein k, 1. r, s, t, u, v and w are 

each independently 0. 1 or 2; with the proviso that the v and w bracketed structures represent 
fiiranose or pyranose moieties and the sum of 1 and k is 1 or 2, and the sum of s and u is 1 or 2, 
and with the proviso that v and w are not simultaneously 0; wherein R'o is hydrogen, a linear or 
branched chain alkyl, acyl, arylalkyl or aryl group; wherein each occurrence of Rio, Rii, R12, R13, 
R,4 and R,5 is independently hydrogen. OH, 0R«', NHR«', NHCOR«', F. CH2OH, CH^OR"', or a 
substituted or unsubstituted linear or branched chain alkyl, (mono-, di- or tri)hydroxyalkyl, 
(mono-, di- or tri)acyloxyalkyl, arylalkyl or aryl group; wherein each occurrence of R16 is 
hydrogen, COOH, COOR", CONHR", a substituted or unsubstituted linear or branched chain 
alkyl or aryl group; wherein each occurrence of R'" is hydrogen, CHO. COO^\ or a substituted 
or unsubstittited linear or branched chain alkyl, acyl, arylalkyl or aryl group; and wherein each 
occurrence of R" and are each independentiy H, or a substituted or unsubstituted linear or 
branched chain alkyl, arylalkyl or aryl group; and wherein each glycosidic moiety is eitiier a- or 
p-linked to an amino acid. 

(00581 In certain embodiments, tiie step may be carried out once, or repeated a desired 

number of times, to prepare a polyftmctionalized peptide having the structure: 


0 


OJ 


rX2 


wherein R'^' and R''^ are as defined above; 
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each occurrence of A may be the same or different and may be as defined for Ai and A2 

above; 

each occurrence of R^* may be the same or different and may be as defined for R^* and 
above; 

q is an integer greater than or equal to 2; 

each occurrence of s is independently an integer firom 1 to about 20; 
each occurrence of t is independently an integer; 
to is an integer; and 

each occurrence of R*^ is independently H, alkyl, heteroalkyl, aryl, heteroaryl, - 
alicyl(aryl), -alkyl(heteroaryl), or a natural or non-natural amino acid side chain. 
[00591 In certain embodiments, q is an integer between 2 and about 5. In certain 

embodunents, q is an integer between 2 and about 10. In certain embodiments, q is an mteger 
between 2 and about 15. In certam embodiments, q is an integer between 2 and about 20. In 
certain embodiments, q is an integer between 2 and about 25. In certain embodiments, q is an 
integer between 2 and about 30. In certain embodiments, q is an integer greater than 30. In 
certain embodiments, q is 2. 

[0060] In certam embodiments, the sum s+t is between about 2 and about 6. In certain 

embodiments, the sum s+t is between about 2 and about 10. In certain embodiments, the sum s+t 
is between about 2 and about 15. In certain embodiments, the sum s+t is between about 2 and 
about 20. In certain embodiments, the sum s+t is between about 2 and about 50. In certain 
embodiments, the sum s+t is between about 2 and about 100. In certain embodiments, the sum 
s+t is between about 2 and about 150. In certain embodiments, the sum s+t is between about 2 
and about 200. In certain embodiments, the sum s+t may be greater than 200. 
[0061] In certam embodiments, tO is an integer firom 0 to about 2. In certain 

embodiments, tO is an integer from 0 to about 5. hi certain embodiments, tO is an integer fix)m 0 
to about 10. In certain embodunents, tO is an integer from 0 to about 15. In certain embodiments, 
to is an integer from 0 to about 20. In certain embodiments, tO is an integer from 0 to about 25. In 
certain embodiments, tO is an integer from 0 to about 30. hi certain embodiments, tO is an integer 
from 0 to about 50. hi certain embodunents, tO is an mteger from 0 to about 100. In certam 
embodiments, tO is an mteger from 0 to about 150. hi certain embodiments, tO is an integer from 
0 to about 200. hi certam embodunents, tO is an integer greater than 200. 
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[0062] 
[0063] 
[0064] 
[0065] 
[0066] 


In certain other embodiments, R'^' is hydrogen, Fmoc or Ac. 
In certain other embodiments, R'" is NH2. 
In certain other embodiments, R''" is -CH2CH2S03Na. 
In certain other embodiments, R^' is -S/Bu. 

In certain exemplary embodiments, the peptide acyl donor having the structure: 

0 



is prepared by reacting R^'^^S-Na with a compound having the structure: 

0 


-ORXOa 


wherein R''"* is a sulfur-substituted aryl moiety. 
[0067] In certain exemplary embodiments, R'^"* has the following structure: 


[0068] In certain exemplary embodiments, in the peptide acyl donor having the structure: 
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the amino acyl residue directly attached to is phenylalanine. 

[00691 In certain embodiments, when at least one occurrence of A (or Ai and/or A2, as 

further defined for A) is a carbohydrate domain, some or all of carbohydrate domains are O- 
linked to the peptide backbone. In certain other embodiments, when at least one occurrence of A 
(or Ai and/or A2, as further defined for A) is a carbohydrate domain, some or all of carbohydrate 
domains are N-linked to the peptide backbone. In certain embodiments, the peptide sequence 
between each point of attachment of the A moieties comprises a cysteine residue. In certain 
embodiments, the polyfimctionaUzed construct (i.e., constnict bearing more than one A moiety) 
is prepared by Native Chemical Ligation. In certain embodiments, the polyfimctionaUzed 
peptides obtained by the inventive method are symmetrical (fimctionalized peptide building 
blocks to be ligated have the same peptide sequence (safe for N- and/or C-termmals) and bear the 
same fimctionalization pattem), nonsymmetrical (fimctionalized peptide building blocks to be 
Ugated differ in peptide sequence (excluding N- and C-terminals) and/or fimctionalization 
pattem). In certain embodiments, the inventive method allows the preparation of 
multiglycosylated peptides designed to approximate the spatial position(s) of carbohydrate(s) in 
glycoprotein/glycopeptides of interest (e.g., naturally occurring glycoproteins such as gpl20 and 
erythropoietin). 

[0070] In certain embodiments, the method fiirther comprises a step of conjugating the 

polyfimctionalized peptide to an immunogenic carrier. In certain exemplary embodiments, the 
carrier is a protein, a peptide or a lipid. In certain other exemplary embodiments, the carrier is 
Bovine Serum Albumin (BSA), Keyhole Limpet Hemocyanin (KLH) or polylysine. In certain 
other embodiments, the carrier is is a lipid carrier having the structure: 
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wherein m, n and p are each independently integers between about 8 and 20; and Rv is 
hydrogen, substituted or unsubstituted linear or branched chain lower alkyl or substituted or 
unsubstituted phenyl. In certain exemplary embodiments, m', n' and p' are each 14 and the lipid 
is tripalmitoyl-S-glycerylcysteinylserine (e.g., PamCys). 

[0071] It will be appreciated that the carrier can be linked to the polyfunctionalized 

peptide either directly or through a crosslinker, and thus the peptide may be attached to a protein, 
peptids, or lipid, as well as a (crosslinker-protein), (crosslinker-peptide) and (crosslinker-Upid) 
moiety. 

[0072] Crosslinkers suited to the invention are widely known in the art (see, for example, 

1994 Pierce Technical Handbook: cross-linking (Appendix A), which is available at 
http://www.piercenet.com/ resources/browse.cfin?fldID=184), including bromoacetic NHS ester, 
6-(iodoacetamido)caproic acid NHS ester, maleimidoacetic acid NHS ester, maleimidobenzoic 
acide NHS ester, etc. In certain preferred embodiments, the crosslinker is MMCCH (4- 
(maleimidomethyl) cyclohexane-l-carboxyl hydrazide). In certain other preferred embodiments, 
the crosslinker is MBS (m-maleimidobenzoyl acid N-Hydroxysuccinimidyl ester). In certain 
embodiments, the crosslinker is a fiagment having the structure: 



whereby said structure is generated upon conjugation of a maleimidobenzoic acid N- 
hydroxy succinimide ester wiA a suitable functionality on the polyfunctionalized peptide. 
[0073] 2) Synthetic Methodology 

[0074] Native Chemical Ligation 
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[0075] One of the more widely used methods for the synthesis of glycopeptides is native 

chemical ligation (NCL)/* First reported by Kent in 1994, NCL allows for the assembly of large 
proteins with native amide bonds from unprotected peptide building blocks (Scheme 1). 
Furthermore, the reaction is mild, selective, and compatible v^ath the presence of glycans. When 
glycans are present in the reaction, they are typically found on the C-terminal side. In the event, 
a glycopeptide containing a C-terminal cysteine undergoes a chemoselective reaction with a 
peptide thioester. The resultmg peptide thioester then rearranges spontaneously to furnish a 
native peptide bond, effectively lengthenmg the peptide backbone of the glycopeptide. 
[0076] Scheme 1. Exemplary Native Chemical Ligation methodology 

rnsma chemoselective »2**^.^^^-^^S^m 


CGlYcag 

" O 


[00771 As discussed above, glycopeptides bearing multiple glycosidic domains are of 

particular interest Such structures include the synthesis of gpl20 fragments, erythropoietin 
(EPO), human chorionic gonadotropin (HCG), and amyloid precursor protein (APP). Drawing 
from previous studies in the area,*' NCL was explored as a way to couple two small 
glycopeptides to form one large glycopeptide. One drawback that became apparent lies in the 
difficulty with which unprotected glycopeptide a-thioesters are synthesized.'" Accordingly, a 
new method for the synthesis of glycopeptide a-thioesters. or their equivalent, was devised, to 
allow the assembly of glycopeptides bearing multiple glycosidic domains. 
10078) A closer look at the reaction mechanism of NCL provided clues as to how this 

problem might be approached. In one embodiment of the reaction itself, the sulfhydryl group of 
the cysteine coupling partner, initially protected as a r-butyldisulfide, is released by disulfide 
exchange with 2-mercaptoethanesulfonic acid (MESNa) and then undergoes trans- 
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thioesterification as mentioned above. Without wishing to be bound to any particular theory, it 
was thought that it might be possible to mask the a-thioester as a phenyl ester, bearing an 
ethyldisulfide in the ortho position (Scheme 2) by taking advantage of this disulfide exchange. 
Under the reaction conditions, the free sulfhydryl would be released and undergo a spontaneous 
rearrangement to furnish a thioester capable of entering into the NCL pathway. 
(00791 Scheme 2. A new method for the generation of peptide a-thioesters. 

NCL: 

o o o 


Modification: 


[00801 To test the viability of this proposed modification, the simple dipeptide, PheCys, 

was synthesized. The desired orf/io-substituted phenol was synthesized in two steps from 
commercially available 2-mercaptophenol by oxidation and disulfide exchange. Coupling of the 
phenol to phenylalanine followed by the removal of the Boc protecting group set the stage for 
our first attempt (Scheme 3). The reaction proceeded in good yield. More importantly no 
racemization was observed under the reaction conditions. 
[00811 Schemes. Exemplary NCL with masked thioester 
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a: 


1) Boc-L-Phe-OH. 
1) I2. MeOH, H2O r^'^^V EDC'- PMAP 


2) BFjWEt. (EtS)2 ^<^SSEt 

H2O 



SSEt PH7.5 O --SH 

78% yield 
No observed 
racemization 


[0082] The synthesis of glycopeptides bearing two glycosidic domains was then 

undertaken. Synthesis of the AT-terminal fragment (the fragment that would ultimately become 
the masked thioester) began with the solid phase synthesis of a peptide acid, followed by 
standard peptide coupling to introduce the phenylalanine bearing the phenyl ester. TFA 
deprotection of the side chain protecting groups furnished the peptide backbone for the N- 
terminal fragment (Scheme 4). 

[00831 Scheme 4. Exemplary synthesis of the peptide backbone of the iV^terminal 

fragment. 

papUde synthesis B : h " = H « CH^.DUf 

^NH NH 


°c„ „ ... 
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[0084] Coupling of an oligosaccharide with excess peptide furnished a fully deprotected 

glycopeptide ready for ligation (Scheme 5). 

[0085] Scheme 5. Exemplary approach to the completion of the A^terminal fragment 



[0086] With the iV-terminal portion in hand, the synthesis of the C-terminal coupling 

partner was then carried out. In a similar way, the peptide backbone was prepared by solid phase 
peptide synthesis. Upon cleavage from the resin, glycosylation and Fmoc removal, the C- 
terminal glycopeptide was obtained (Scheme 6). 

[0087] Scheme 6. Exemplary synthesis of the C-terminal glycopeptide 


Page 31 of 56 

Express Mail No.: EV 170393236 US 
3665577V2 


Attorney Docket No.: 2003080-0149 
Client Reference No.: SK-1 IS6-PRO 


[0088] In certain exemplary embodiments, the synthesis of glycopeptides bearing two 

glycosydic domains was accomplished as follows: equal molar amounts of both the iV-terminal 
and C-terminal glycopeptides were combined in a LCMS vial and to this was added a solution of 
MESNa in phosphate buffered saline. The reaction was monitored by LCMS and to our 
amazement, the rearrangement of the phenyl ester to thioester occurred quite rapidly/* 
additionally mass peaks corresponding to the desired product were observed ahnost instantly. 
Upon completion of the reaction, any remaining disulfide bonds were reduced by the action of 
tris(2-carboxyethyl) phosphine hydrochloride (TCEP). The crude reaction mixtiire was then 
subjected to HPLC for purification. The reaction proceeded extremely well, providing doubly 
glycosylated glycopeptides, typically in 60-70% yield. Exemplary di-glycosylated peptides tiiat 
may be prepared by the method of the present invention are detailed in the Exemplification 
below. It should be noted tiiat symmetrical, nonsymmetrical and mixed (AMinked and O-linked) 
glycopeptid^ can be used. 

[0089] Accordingly, there is provided herein a method for preparing glycopeptides 

comprising at least two carbohydate domains covalently attached tiiereto. In certain 
embodiments, some or all of carbohydrate domains are O-linked. In certain otiier embodiments, 
some or all of carbohydrate domains are N-linked. hi certain embodiments, tiie glycopeptide 
comprises two or more carbohydate domains covalentiy attached thereto, wherein the 
glycopeptide sequence between each point of attachment of the carbohydrate domains comprises 
a cysteine residue. In certain embodiments, the mutii-glycan glycopeptide is prepared by Native 
Chemical Ligation. In certain embodiments, the method allows for coupling where each 
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coupling partner is a glycopeptide itself. Symmetrical, nonsymmetrical and mixed (N-linked and 
0-linked) glycopeptides can be obtained. In certain embodiments, the method involves the in 
situ generation of a thioester that is then used immediately in native chemical ligation. 
[0090] Peptide Thioester Synthesis 

[00911 Several methods have been developed for peptide thioester synthesis, including 

the original "Boc chemistry" (Boc = /er/-butyloxycarbonyl) method (See, for example, (1) 
Canne, L. E.; Walker, S. M.; Kent, S. B. H. "A General Method for the Synthesis of Thioester 
Resin Linkers for Use in the Solid-Phase Synthesis of Peptide Alpha-Thioacids." Tetrahedron 
Lett. 1995, 36, 1217-1220; and (2) Hojo, H.; Aimoto, S. "Polypeptide Synthesis Using the S- 
Alkyl Thioester of a Partially Protected Peptide Segment-Synthesis of Ac DNA- Binding 
Domain of C-Myb Protein (142-193)-NH2." Bull Chem. Soc. Jpn. 1991, 64, 111-117) and 
several Fmoc-compatible systems (See, for example, (1) Shin, Y.; Winans, K. A.; Backes, B. J.; 
Kent, S. B. H.; Ellman, J. A.; Bertozzi, C. R. "Fmoc-based synthesis of peptide-(alpha)thioesters: 
Application; to the total chemical synthesis of a glycoprotein by native chemical ligation." J. Am. 
Chem. Soc. -1999, 121, 11684-11689; (2) Ingenito, R.; Bianchi, E.; Fattori, D.; Pessi, A. "Solid 
phase synthesis of peptide C-terminal thioesters by Fmoc/t-Bu chemistry." J. Am. Chem. Soc. 
1999, 121, 11369-11374; (3) Li, X. Q.; Kawakami, T.; Aimoto, S. "Direct preparation of peptide 
thioesters using an Fmoc solidphase method." Tetrahedron Lett. 1998, 39, 8669-8672; (4) 
Clippingdale, A. B.; Barrow, C. J.; Wade, J. D. "Peptide thioester preparation by Fmoc solid 
phase peptide synthesis for use in native chemical ligation." J. Pept. Sci. 2000, 6, 225-234; and 
(5) Bu, X. Z.; Xie, G. Y.; Law, C. W.; Guo, Z. H. "An improved deblocking agent for direct 
Fmoc soUdphase synthesis of peptide thioesters." Tetrahedron Lett. 2002, 43, 2419-2422). In 
ceratin embodiments, the model thioester is a C-terminal glycine thioester, which is locaUy 
achiial and cannot be epimerized, and is therefore easy to synthesize. Though the desired GP120 
thioester contains an epimerization-prone C-terminal histidine (His) residue, such thioesters have 
been synthesized previously and have in fact been shown to modulate favorably the rate of NCL 
(See, for example, Hackeng, T. M.; Griffin, J. H.; Dawson, P. E. "Protem synthesis by native 
chemical ligation: Expanded scope by using straightforward methodology." Proc. Ufatl. Acad 
Sci. U. S A. 1999, 96, 10068-10073). 

[0092] It is understood that, although the discussion above regardmg Native Chemical 

Ligation focuses on the preparation of multiglycosylated peptides, the method may readily be 
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adapted to other polyfunctionalized peptides and/or proteins. For example, peptides and/or 
proteins functionalized at more than one amino acid site with a pharmaceutically useful group or 
entity may be prepared by the method of the invention. 

Equivalents 

[0093] The representative examples which follow are intended to help illustrate the 

invention, and are not intended to, nor should they be construed to, limit the scope of the 
invention. Indeed, various modifications of the invention and many further embodiments 
thereof, in addition to those shown and described herem, will become apparent to those skilled in 
the art ftom the foil contents of this document, including the examples which follow and the 
references to the scientific and patent literature cited herein. In but one illustrative example, 
protecting groups play an important role in the syntiiesis of the carbohydrate domains and 
syntiietic conjugates, as described herein; however it will be appreciated by one of ordinary skill 
in the art that the present invention encompasses tiie use of various alternate protecting groups 
known in the art Those protecting groups used in the disclosure including the Examples below 
are merely illustrative. 

10094] It should forther be appreciated tiiat, uless otherwise indicated, tiie contents of 

tiiose cited references are incorporated herein by reference to help illustrate tiie state of the art. 
Hie following examples contain important additional information, exemplification and guidance 
which can be adapted to tiie practice of tiiis invention in its various embodiments and tiie 
equivalents thereof . 

Exemplification 

[0095] The metiiod of tiiis invention can be understood fiirther by tiie examples tiiat 

illustrate some of tiie processes by which tiie inventive metiiod may be practice. It will be 
appreciated, however, tiiat tiiese examples do not limit tiie invention. Variations of tiie 
invention, now known or fiirther developed, are considered to fall witiiin tiie scope of tiie present 
invention as described herein and as hereinafter claimed. 
[0096] 1) General Descr^tion of Synthetic Methods: 

[0097] The practitioner has a a well-established literature of peptide, protein and 

glycoside chemistry to draw upon, in combination witii tiie infomiation contained herein, for 
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guidance on synthetic strategies, protecting groups, and other materials and methods useful for 
practicing the method of this invention. 

[0098] The various references cited herein provide helpful backgroimd information on 

preparing complex glycosides, glycosylated peptides and other glycosylated constructs that may 
be applied and/or adapted to the method of the present invention. 

[00991 According to the present invention, any available techniques can be used to 

practice the method of the invention. For example, a variety of solution phase synthetic methods 
such as those discussed in detail below may be used. Alternatively or additionally, a variety 
combinatorial techniques, parallel synthesis and/or solid phase synthetic methods known in the 
art may be used. 

[01001 Starting materials and reagents used in practicing the specific embodiments of the 

invention detailed below and herein are either available from commercial suppliers such as 
Aldrich Chemical Company (Milwaukee, WI), Bachem (Torrance, CA), Sigma (St. Louis, MO), 
or are prepared by methods well known to a person of ordinary skill in the art following 
procedures described in such references as Fieser and Fieser 1991, "Reagents for Organic 
Synthesis", vols 1-17, John Wiley and Sons, New York. NY, 1991; Rodd 1989 "Chemistry of 
Carbon Compounds", vols. 1-5 and supps, Elsevier Science Publishers, 1989; "Organic 
Reactions", vols 1-40, John Wiley and Sons, New York, NY, 1991; March 2001, "Advanced 
Organic Chemistry", 5th ed. John Wiley and Sons. New York, NY; and Larock 1990, 
"Comprehensive Organic Transformations: A Guide to Functional Group Preparations", l*"* ed. 
VCH Publishers. These schemes are merely illustrative of some embodiments of the present 
invention, and various modifications to these schemes can be made and will be suggested to a 
person of ordinary skill in the art having regard to this disclosure. 

[01011 In practicing the invention, starting materials, intermediates, and compounds may 

be isolated and purified using conventional techniques, including filtration, distillation, 
crystallization, chromatography, and the like. They may be characterized using conventional 
methods, including physical constants and spectral data. 

(01021 Methods for preparing glycopeptides (e.g. , O- or N-linked glycopeptides) and for 

conjugating peptides and glycopeptides to carriers are known in the art. For example, guidance 
may be found in U.S. Patent No.: 6,660,714; U.S. Patent Application Nos.: 09/641,742; 
10/209,618; and 10/728,041; U.S. Provisional Patent Application Nos.: 60/500,161 and 
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60/500,708; and International Patent Application Nos.: PCT/ US03/38453 and PCT/ 
US03/38471; each of the above-referenced patent documents are hereby incorporated by 
reference herein. 
[0103] Overview 

[01041 Following extensive earlier studies, wherein our chemistry was interfaced with 

important findings from other laboratories,'^ we demonstrated the ability to ligate fully synthetic 
iV-linked glycopeptides containing various complex oligosaccharides to larger polypeptide 
domains." For example, major glycosylated subsections of prostate specific antigen (PSA)^"^'' 
Reference source not found.a gpHO'* could be obtained. Provided herein is an exemplary convergent 
approach to access polypeptides multiply glycosylated at designated sites.'* As discussed 
previously, glycoproteins have potentially important clinical roles in the contexts of vaccines,^ 
diagnostics,^ and tiierapeutics.^ The method of the present invention thus has important 
applications in the field of medicine since it offers access to complex therapeutically usefiil 
glycoproteins. 

[0105] Scheme 7 schematizes an embodiment of the present invention. For example, 

glycopeptides 4 and 7 may be generated in a convergent way from their respective glycan and 
peptide precursors, and may be coupled with full convergence to reach 8. 
[0106] Scheme 7. A convergent synthesis of bifunctional glycopeptides. 


Express Mail No.: EV 170393236 US 
3665577v2 


Page 36 of 56 


Attorney Docket No.: 2003080-0149 
Client Reference No.: SK-USfr-PRO 



(Glycan = complex oIlBOsaccharide j 
[01071 In one embodiment, the synthesis of a suitable glycopeptide acyl donor for 

ligation with 7 was investigated. One approach was based on utilizing buildmg block 4, in the 
form of a thioester, as acyl donor. However, the preparation of a thioester such as 4 by 
convergent means^* appeared to present difficulties. 

[0108] It was in pondering this problem that an attractive possibility presented itself 

(Scheme 8). The synthesis of a phenolic ester equipped with an unsymmetrical disulfide (c/ 10) 
was then investigated. The phenolic ester linkage would in itself probably not manifest 
sufficient reactivity to serve as a viable acylating agent However, equipped as it would be with 
an or//io-disulfide moiety, the phenolic ester could operate as an incipient acyl donor. Thus, 
reduction of the disulfide in 12. as shown, might well set the stage for in situ elaboration into a 
thioester (cf. 13^14) by 0-*S acyl migration. Thioester 14, present in an unfavorable, but 
dynamic equilibrium, could perhaps be interdicted by the machinery appropriate to native 
chemical ligation (NCL),'^ ultimately cuhninating in 8. 

[0109] While the proposal for the intermediacy of a discreet thioester 14 is certainly 

attractive, it is not necessarily an obligatory event en route to ligation. Thus, as suggested in 
structure 13, the presence of a firee orf/ia-benzene thiol fimction might well enhance the acylating 
facility of the phenolic ester," perhaps through in situ hydrogen bonding or anchimeric 
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assistance. In either case, the sequence is set into motion only upon reductive cleavage of the 
disulfide. 

[01101 The synthesis of the masked glycopeptide acyl donor was accomplished as 

follows (Scheme 8). CommerciaUy available 2-mercaptophenol (9) was oxidized to its 
symmetrical disulfide. Subsequent exchange with excess ethyl disulfide** fiimished an 
unsymmetrical, aryl-alkyl disulfide. Acylation of the phenolic hydroxyl group with Boc- 
phenylalanine followed by acidic cleavage of the 3oc protecting group afforded phenylalanine 
derivative 10. Standard peptide coupling of 10 with a fiilly protected peptide acid*' followed by 
global deprotection provided 11,^** which contains a free aspartate residue. Aspartylation of a 
chitobiose-derived glycosylamine with 11 afforded the virtual acyl donor 12, corresponding to 
our target stmcture 4 (Scheme 7). 

[0111] Scheme 8. A structure, synthesis, and proposed mechanism of a masked 

glycopeptide acyl donor." 



fcGlyca^l = chitobiose in 12- 


= FmocRNRSG-in12-15; 


= CHjCHzSOaNa 
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[0112] " Reagents and conditions: (a) I2, MeOH. H2O; (b) BFj-OEtz, EtSSEt, CHjCh, 

99%, 2 steps; (c) Boc-Phe-OH, EDCI, DMAP. CHjClzmiF, 93%; (d) 4N HCl/dioxane, 94%; (e) 
Fmoc-Arg(Pb£)-AspCBu)-Arg(Pbf).Ser(^u)-Gly-OH, HATU. DIEA, DMF, 61%; (f) 
TFA:phenol:Et3SiH:H20 35:2:1:1, 60%; (g) GlcNAcpi->4GlcNAcP-NH2. HATU. DIEA, 
DMSO, 52%. 

[0113] As a preliminary probe of the practicability of this scheme, phenylalanine 

derivative 10 was subjected to the neutral, aqueous, reducing reaction conditions described 
above, along with free cysteine (Scheme 9). The desired free dipeptide Phe-Cys (17) was 
obtained in 78% isolated yield. Only a single product was observed by LCMS and 'H NMR. 
strongly suggesting that epimerization at the a-carbon of phenylalanine did not occur. 
[01141 Scheme 9. An example of the latent acyl donor function. 


Etss ^ ^SH excess ^ ^SH 


pH-7.4 " O 

10 16 78% yield 17 

[01151 The synthesis of bifunctional glycopeptides was then carried out. Thus, 12 was 

treated with excess sodium 2-mercaptoethanesulfonate (MES-Na) at neutral pH in aqueous 
phosphate buffered saline (PBS) in the presence of 18.'^ containing a latent N-terminal cysteine 
residue (Scheme 10). As observed by LCMS during the reaction, 12 was almost instantaneously 
(less than 3 min) converted to the thioester derived from MES-Na (c/ IS); very littie hydrolysis 
to the free carboxylic acid was noted. Over the course of the next several hours, we observed a 
gradual decrease in the amount of this thioester, with a concomitant increase in the desired 
product 19. Only one compound of the appropriate mass was observed by LCMS and *H NMR. 
again indicating a lack of epimerization. 

[01 1 6] Scheme 10. Exemplary convergent coupling of functionalized glycopeptides. 
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F»noc.Arg.Ain.Arg^er^ly-B^oA^ * "^^V^Ala-Ain-Ala-SorJ^Hz 

(excess 
MES-Na. 


Ph^AcHN o aSSj X 

sn^rg-Ser-Gly-N'^'^Y'^AIa-Asn-A 


Fmoc-Arg-Asn^rg-Ser-Gly-N'^'^Y'^AIa-Asn-Ala-Ser.NHa 

**'SH 
19 Phe - Cys 

[01171 With this proof of principle in hand, attention was directed to investigating the 

generaUty of the method. For example, applicability of the method to incorporation of O-linked 
domains^ was established, as shown through the synthesis of 20. Notably, the histidine- 
contaming system 21 was also assembled (for structure of 21 and 20, see Figure 1). Since 
activated C-terminal histidine residues are known to be particularly susceptible to 
epimerization.^ a prophylactic dinitrophenyl (DNP) protecting group^^ was used for the 
imidazole t-nitrogen during the synthesis. As expected, the DNP group was cleaved 
concurrently during the mildly nucleophilic coupling reaction. No indications of epimerization 
were observed by LCMS or 'H NMR. 

[0118] Validation of the highly convergent methodology in the context of complex 

glycans was then investigated. Syntheses of glycosylamines 22'^ and 23^' (Figure 2) have been 
reported previously. It will be noted that these compounds differ in a single stereogenic locus in 
the interior C-ring (see asterisks). Thus 22 corresponds to a fiilly stereochemically competent N- 
linked high mannose oligosaccharide. In contrast, its C-ring C-2 epimer is an unnatural 
stereochemical mutant Studies aimed at evaluating whether high-mannose binding lectins can 
discriminate between point mutants that differ in only one out of a total of 25 stereochemical loci 
are underway. 

[0119] Preparation of pentasaccharide-derived glycopeptide coupling partners proceeded 

via Lansbury aspartylation'* of 22 and 23 with 25 and 26, which afforded 27 and 28, 
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respectively. Coupling of 28 with a relatively simple chitobiose-containing acyl donor gave 24 
(Figure 2) in 77% yield. 

[01201 Scheme 11. Convergent coupling of two functionalized glycopeptides containing 

distinct, structurally complex glycans." 



Cys— Ala— Adn-nAla— Sei^NH2 

29 

[01211 " Reagents and conditions: (a) 25 or 26, HATU, DDEA, DMSO, 50% for 27 (from 

22), 71% for 28 (from 23); (b) 20% piperidine in DMF, 63%; (c) 0.2 M phosphate, 0.2 M NaCl, 
pH -7.4, excess MES-Na, 75%. 

[01221 Coupling of 27 and 28 was accomplished smoothly upon reductive cleavage of 

the disulfide linkage in 27, affording 29 as shown in Scheme 11. Aside from vaUdating the 
methodology in a striking way. the synthesis of 29 serves to pinpoint the power of the total 
synthesis approach: Since one of its carbohydrates is unnatural, 29 cannot readily be obtained 
from natural sources or via enzymatic manipulations. 

(01231 In summary, a convergent method for the synthesis of diglycosylated peptides is 

described herein. The mechanistic rationale set fortii in Scheme 8 is supported by tiie 
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identification of the MES-Na thioester as a reaction intermediate. Furthermore, potential 
hydrolysis and C-terminal epimerization of the glycopeptide acyl donor have been suppressed. It 
seems likely that the method and logic set forth above will enjoy application in the building of 
complex glycopeptides of biological and even medicinal consequence. 

[0124] It will be appreciated that the method can readily be applied to the preparation of 

multi-glycosylated peptides (e.g., comprising more than two non-adjacent glycosides covalently 
linked at designated sites). 
[0125] General Reaction Procedures; 

[0126] Unless mentioned specifically, reaction mixtures were stirred using a 

magnetically driven stirrer bar. Reactions involving air or moisture-sensitive reagents or 
intermediates were performed under argon or nitrogen atmosphere in glassware which had been 
heat gun or flame-dried under high vacuum. An inert atmosphere refers to either dry argon or 
dry nitrogen. Reactions were monitored either by thin layer chromatography, by proton nuclear 
magnetic resonance (NMR) or by high-pressure liquid chromatography (HPLC), of a suitably 
worked up sample of the reaction mixture. 

[0127] Indicated reaction temperatures refer to those of the reaction bath, while room 

temperature (rt) is noted as 22 "C. Preparative reactions were stined magnetically. 
Tetrahydrofuran (THF), diethyl ether (Et20), methylene chloride (CH2CI2), and toluene were 
obtained from a dry solvent system (activated alumina columns, positive pressure of argon). All 
other solvents were used as received in Sure/Seal botties (Aldrich). Triethylamine (EtaN), 
diisopiopylethylamine (i-PriNEt), pyridine, 2,6-lutidine, and chlorotrimethylsUane (TMSCl) 
were distilled from CaRi immediately prior to use. All other reagents were purchased from 
Aldrich at the highest commercial quality and used without further purification, with the 
exception of the Stryker reagent which was purchased from Fluka, the RCM catalysts 16 and 17 
which were purchased from Strem, and biotin-dPEG4-hydrazide which was purchased bom 
Quanta Biodesign. 

[0128] General Wor k Up Procedures; 

[0129] Unless mentioned specifically, reaction mixtures were cooled to room 

temperature or below then quenched, when necessary, with either water or a saturated aqueous 
solution of ammonium chloride. Desired products were extracted by partitioning between water 
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and a suitable water-immiscible solvent {e.g. ethyl acetate, dichloromethane, diethyl ether). The 
desired product containing extracts were washed appropriately with water followed by a 
saturated solution of brine. On occasions where the product containing extract was deemed to 
contain residual oxidants, the cxtraa was washed with a 10% solution of sodium sulphite in 
saturated aqueous sodium bicarbonate solution, prior to the aforementioned washing procedure. 
On occasions where the product containing extract was deemed to contain residual acids, the 
extract was washed with saturated aqueous sodium bicarbonate solution, prior to the 
aforementioned washing procedure (except in those cases where the desired product itself had 
acidic character). On occasions where the product containing extract was deemed to contain 
residual bases, the extract was washed with 10% aqueous citric acid solution, prior to the 
aforementioned washing procedure (except in those cases where the desired product itself had 
basic character). Post washing, the desired product containing extracts were dried over 
anhydrous magnesium sulphate, and then filtered. The crude products were then isolated by 
removal of solvent(s) by rotary evaporation under reduced pressure, at an appropriate 
temperature (generally less than 45''C). 
[0130] General Purification Procedures! 

[01311 Unless mentioned specifically, chromatographic purification refers to flash 

column chromatography on silica, using a single solvent or mixed solvent as eluent. Suitably 
purified desired product containing elutes were combined and concentrated under reduced 
pressure at an appropriate temperature (generally less than 45°C) to constant mass. Final 
compounds were dissolved in 50% aqueous acetonitrile, filtered and transferred to vials, then 
fieeze-dried under high vacuum before submission for biological testing. 
[0132] Analytical Equipment : 

[0133] HPLC: All separations involved a mobile phase of 0.05% TFA (v/v) in water 

(solvent A)/0.0425% TFA in acetonitrile (solvent B). Preparative, semipreparative, and 
analytical HPLC separations were performed using a Rainm HXPL solvent delivery system 
equipped with a Rainin UV-1 detector and one of the following Dynamax-60A CI 8 axial 
compression columns 250 mm in length equipped with a similarly packed guard column: 41.4 
mm diameter (prep), 21.4 m diameter (semiprep), or 4.6 mm diameter (analytical). Separations 
were performed at flow rates of 48 mL/min (prep). 16 mL/min (semiprep), or 1 mL/min 
(analytical), and were monitored at a wavelength between 214 and 230 nm, depending on 
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column loading. LCMS chromatographic separations were performed using a Waters 2695 
Separations Module and a Waters .996 Photodiode Array Detector equipped with a Varian 
Microsorb CIS 2 x 150 mm column at a flow rate of 0.2 mL/min. 

[01341 ESMS and LCMS: Electrospray mass spectroscopy and LCMS analyses were 

obtained on a Waters Micromass ZQ mass spectrometer in conjunction with the Waters HPLC 
apparatus described above. 

[0135] NMR: 'H and "C NMR spectra were recorded on Bruker instruments in CDCI3, 

CD3OD or D2O at 400 or 500 MHz for *H and 100 or 125 MHz for "C. 

[0136] Reagents: All commercial materials were used as received unless otherwise 

noted. The following solvents were obtained from a dry solvent system and used without further 
purification: THF, diethyl ether, toluene, and DCM. Reagents were obtained from Aldrich or as 
noted, with the following exceptions: amino acids and resins for solid phase peptide synthesis 
were purchased from NovaBiochem; Biosynthesis grade DMF from EM Science; and all other 
solvents from Fisher Scientific (HPLC grade). 

[0137] To a stirred, biphasic solution of 2-mercaptophenol (1.0 g, 7.9 

mmol) in H2O (5.1 mL) was added, drop wise, a solution of iodine (1.0 g 4.0 
mmol) in methanol (3.5 mL). When the brown iodine color persisted the 
solution was diluted with ethyl acetate and water. The aqueous layer was removed and extracted 
with an additional portion of ethyl acetate. The combined organic layers were dried and washed 
with brine then dried (Na2S04) and concentrated to give a brown oil which was used without 
purification (1.5 g). The product stiU contains iodine. 'H NMR (CDCI3. 500 MHz): 8 7.33-7.37 
(m, 2H), 7.22-7.24 (m, 2H), 6.99-7.01 (m, 2H), 6.82-6.85 (m, 2H), 6.22 (brs, 2H). ^H NMR 
(CDCI3, 400 MHz): 8 7.35 (dt, J = 8.16, 1.65 Hz, 2H), 7.22 (dd, J = 7.52, 1.65 Hz, 2H), 7.00 
(dd, J = 8.16, 1.15 Hz, 2H), 6.83 (dt, 

J= 7.52, 1.15 Hz, 2H). 6.22 (s, 2H) "C NMR (CDCI3, 125 MHz): 8 157.3, 136.6, 133.6, 121.4, 
120.3,116.1. ESI-MS:Calcd.forCi2Hio02S2[M+NH4]* 267.8 Found: 267.8. 

[0138] To a stirred solution of the disulfide (1.5 g crude, '-4 mmol) in 

CXs'®^ CH2CI2 (25 mL) was added ethyldisulfide (10.5 mL, 79.9 mmol) and then 
BF3»OEt2 (10.1 mL, 79.9 mmol). The reaction was stirred at room temperature for three hours 
and then carefully quenched by the addition of (aq.) NaHCOs. The organic layer was drained 
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and the aqueous layer was extracted with an additional portion of CH2CI2. The combined 
organic layers were dried (MgS04) and concentrated to give a yellow oil. Purification by siUca 
gel chromatography (20% ethyl acetate in hexane) gave the desired product as a clear, slighUy 
yellow oil (1.45 g, 99%). 'H NMR (CDCI3, 500 MHz): 5 7.48-7.51 (m, IH), 7.28-7.32 (m, IH), 
6.99-7.01 (m, IH), 6.86-6.9 (m, IH), 6.34 (brs, IH), 2.78 (q.7= 7.4 Hz. 2H), 1.35 (t.y= 7.4 Hz, 
3 H). '^C NMR (CDCI3, 125 MHz): 6 156.9, 135.2, 132.2, 121.0. 116.2, 32.4, 14.1. ESI-MS: 
Calcd. for C8H10OS2 [M+Na]* 208.8 Found: 208.8. 

[0139] To a solution of the phenol (1.45 g, 8 mmol) and Boc-Phe-OH 

(2.65 g, 10 mmol) in CH2CI2 (25 mL) and THF (5 mL) was added EDCI 
(1.92 g, 10 mmol) and DMA? (98 mg, 0.8 mmol). The resulting solution was 
° 1^1^ stirred at room temperature for 18 hr at which point the volatile materials were 
removed in vacuo. The resulting oil was taken up in EtOAc and washed with IN HCl, H2O, and 
then brine. The organic layer was dried (Na2S04) and concentrated to give a slightly yellow oil. 
Purification by silica gel chromatography (30% ethyl acetate in hexane) gave a clear, coloriess 
oil (3.5 g, >99%). *H NMR (CDCI3. 400 MHz): 6 7.78-7.81 (m IH), 7.23-7.36 (m. 7H), 7.01- 
7.03 (m. IH), 5.02 (d, J = 8.16 Hz, IH), 4.86-4.91 (m, IH), 3.31 (abx, 
J^t = 13.98 Hz, 7« = 5.41 Hz. Ac = 7.22 Hz, 2H), 2.72 (q, J= 7.33 Hz. 2H), 1.42 (s, 9H), 1.29 (t. 
J= 7.33 Hz. 3H). ESI-MS: Calcd. for C22H27NO4S2 [M+Naf 456.0 Found: 456.0. 

[0140] To a 250 mL round-bottomed flask equipped with a magnetic stir 

bar was added the above phenylalanine derivative (3.0 g, 6.9 mmol) as well as a 
Q jT^ 4m solution of HCl in dioxane (86 mL, 345 mmol) and the reaction was stirred 
10 sEt under argon at room temperature for 1.5 hr. At that point the reaction was 
concentrated in vacuo, leaving a white solid. This material was triturated with ether and 
subsequently concentrated. This process was repeated (3x) leaving the crude product as a white 
soUd. The material was dissolved in 30% B, shell frozen and lyophilized to give a white powder 
(2.17 g, 94%). 'H NMR (CD3OD. 400 MHz): 8 7.93-7.95 (m. IH), 7.52-7.42 (m, 7H), 7.23-7.21 
(m,lH), 4.81 (dd. J = 8.35, 5.73 Hz, IH), 3.56 (abx. Jab = 14.48 Hz, Jox = 5.73 Hz, J&c = 8.35 Hz. 
2H). 2.85 (q, J = 7.34, 2H), 1.37 (t, J= 7.34, 3H). '^C NMR (CDCI3, 125 MHz): 5 169.0, 149.4. 
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135.6, 131.9, 131.3, 131.0, 130.7, 129.9, 129.6, 129.1. 123.9, 55.6, 38.0, 34.1, 14.9. ESI-MS: 
Caicd. for C17H19NO2S2 [M+H]* 334.1 Found: 334.2. 

[01411 The phenylalanine derivative 10 (15 mg, 45 \imo\) and L- 

H « cysteine (6 mg, 49 nmol) were placed into a LCMS vial along with a flea-sized 
stirbar. In a second vial were mixed MES-Na (25 mg, 150 nmol) and 
phosphate buffered saline (PBS) (0.2M NaCl, 0.2M phosphate, pH = 7.5, 2 mL). The MES-Na 
solution was then added directly to the amino acids, and the reaction was monitored by LCMS. 
After two hours the reaction appeared to be complete and TCEP (129 mg, 450 nmol) was added. 
This was stirred for 1 hour and then injected directly onto the HPLC for purification. The 
desired compound was obtained as a white powder after lyophilization (9.4 mg, 78%). *H NMR 
(D2O): 8 7.36-7.45 (m, 3H), 7.31-7.33 (m, 2H), 4.63 (dd, J = 6.61, 4.83 Hz, IH), 4.34 (dd, 7 = 
7.24, 7.18 Hz, IH), 3.25 (abx. Jab = 14.1 Hz, Jax = 7.18 Hz, J^x = 7.24 Hz, 2H), 2.96 (abx. Jab = 
14.2 Hz, J« = 4.83 Hz, Jfcx = 6.61 Hz, 2H). LCMS: 5-65%B over 20 min, rt = 8.53 mm. HPLC: 
5-95%B over 30 min, rt = 11.45 mm. ESI-MS: Calcd. for CuHisNzOaS [M+H]"^ 269.1, Found: 
269.1 

[0142] Exemplary ligation conditions: 

[0143] The two glycopeptide halves (12, 2.2 mg, 1.44 »miol) (18, 1.4 mg, 1.44 ^mol) 

were placed in a LCMS vial along with a flea-sized stirbar. A stock solution of MESNa 
(18.3 mg. 111 mmol) in phosphate buffered saline (0.2M NaCl, 0.2M phosphate, 
pH = 7.4, 1 mL) was made and of this, 600 \iL was added to the glycopeptides. The reaction was 
monitored by LCMS and, once finished, TCEP (25 mg, 0.087 mmol) was added and the solution 
stirred for 2 hr then injected directly onto the HPLC for purification. 


HO— ^ .>.. 


/ '■ r 
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[0144] LCMS: 5-65%B over 20 min, rt = 9.98 min. HPLC: 25-55%B over 30 min, 

rt = 7.65 min. ESI-MS: Galcd. for C93H,38N24037S [M+2H]^" 1108.5, Foimd: 1108.6, [M+3H]^" 
739.3, Found: 739.5. 


AcHN^ K 


J " O ^ O O >sg^ O i 


[0145] LCMS: 5-65%B over 20 min, rt = 11.40 min. HPLC: 5-65%B over 20 min, 

rt = 9.85 min. ESI-MS: Calcd. for C73H,2oN2203oS [M+2H]^* 909.4, Found: 909.5, [M+3H]^ 
606.6, Found: 606.8. 

[01461 LCMS: 5-65%B over 20 min, rt = 4.02 min. HPLC: 5-65%B over 20 min, rt = 

6.71 min. ESI-MS: Calcd. for C73H120N22O30S [M+2H]^* 898.4, Found: 898.6. 


^NH2 HN^NHa 


HO^OH 
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[0147] LCMS: 5-65%B over 20 min. rt = 7.81 min. HPLC: 5-65%B over 20 min, rt = 

8.39 min. ESI-MS: Calcd. for C98H,6oN2405iS [M+2H]^" 1261.5, Found: 1261.5, 
841.4, Found: 841.5. 



[0148] LCMS: 5-45%B over 20 min, rt = 10.82 min. HPLC: 5-45%B over 20 min, 

rt= 9.31 min. ESI-MS: Calcd. for C98H,6oN2405iS [M+2Hr 1504.6. Found: 1504.9, [M+3H]'* 
1003.4, Found: 1003.7. 

Abbreviations and Glossary 


[0149] 

A: alanine 

[0150] 

Ac: acetyl 

[0151] 

ACT: al-antichymotrypsin 

[0152] 

Ala: alanine 

[0153] 

Arg: ar^Jiine 

[0154] 

Asn: aspaiagine 

[0155] 

Asp: aspartic acid 

[0156] 

Bn: benzyl 

[01571 

Boc: /erf-butyloxycarbonyl 

[0158] 

BPH: benign prostatic hyperplasia 

[0159] 

BSP: benzenesulfinyl piperidine 

[0160] 

Bu: butyl 

[0161] 

Bz: benzoyl 

[0162] 

CAN: eerie ammonium nitrate 
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[0163] coll: jrym-coUidine 

[0164] C-tenninus: peptide carbonyl terminus 

[0165] Cys: cysteine 

[0166] D: aspartic acid 

[0167] DIEA: iV^-diisopropylethylamine 

[0168] DMAP: A^^-Dimethylaminopyridine 

[0169] DMF: dimethyl fonnamide 

[0170] DMSO: dimethyl sulfoxide 

[0171] DTBMP: di-/err-butylmethylpyridine 

[0172] DTBP: di-ter/-butylpyridine 

[0173] Et: ethyl 

[0174] Fmoc: 9-fluorenylmethyloxycarbonyl 

[0175] G: glycine 

[0176] Gal: galactose 

[0177] Glc: glucose 

[0178] G]n: glutamine 

[0179] Glu: glutamic acid 

[0180] Gly: glycine 

[0181] H: histidine 

[0182] HATU: 7-azahydroxybenzotriazolyl tetramethyluronium hexafluorophosphate 

[0183] His: histidine 

[0184] He: isoleucine 

[0185] K: lysine 

[0186] kDa: kilodaltons 

[0187] KLH: keyhole limpet hemocyanin 

[0188] L: leucine 

[0189] Leu: leucine 

[0190] Lys: lysine 

[0191] Man: mannose 

[0192] MES-Na: 2-mercaptoethanesulfonic acid, sodium salt 

[0193] MHC: major histocompatibility complex 
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[0194] 

N: asparagine 

[0195] 

NAc: iV-acetyl 

[0196] 

NCL: native chemical ligation 

[0197] 

N-terminus: peptide amine terminus 

[0198] 

O-linked: linked through an ethereal oxygen 

[01991 

Pam3Cys:tripaknitoyl-S-glycerylcysteinylserine 

[0200] 

PBS: phosphate-buffered saline 

[0201] 

Ph: phenyl 

[0202] 

Phth: phthalimido- 

[0203] 

PMB: />-methoxybenzyl 

[0204] 

Pro: proline 

[0205] 

GP120: prostate specific antigen 

[0206] 

Py: pyridine 

[0207] 

QS21: a glycosteroidal immunoaduvant 

[0208] 

R: arginine 

[0209] 

S: serine 

[0210] 

sat. aq.: saturated aqueous 

[0211] 

Set: serine 

[0212] 

T: threonine 

[0213] 

TBAF: : tetra-w-butylammonium fluoride 

[0214] 

TBS: fert-butyldimethylsilyl 

[0215] 

/Bu: tert-butyl 

[0216] 

Tf: trifluoromethanesulfonate 

[0217] 

TFA: Trifluoroacetic acid 

[0218] 

THF: tetrahydrofiiran 

[0219] 

Thr: threonine 

[0220] 

t-GP120: total prostate specific antigen 

[0221] 

TMSCl: Trimethiylsilyl chloride 

[0222] 

Trp: tryptophan 

[0223] 

V: valine 

[0224] 

Val: valine 
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[02251 W: tryptophan 
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Abstract 

The present invention provides a method for preparing polyfunctionalized peptides and/or 
proteins at non-adjacent designated sites via native chemical ligation. In certain embodiments, 
the inventive method is a method for preparing a polyfunctionalized peptide comprising a 
peptidic backbone made up of four or more amino acids, wherein two or more non-adjacent 
amino acids are independently substituted with a moiety having the structure: 


H-'-l. 


wherein A and L' are as defined herein. 

In certain other embodiments, the inventive method allows the preparation of 
polyfunctionalized peptides having the general structure: 


0 


rPO 


L R"^' J 


wherein A, R™, R"*', R'", tO, s, t and q are as defined herein. 
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